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Abstract. We consider the inverse problem of estimating a function u from 
noisy, possibly nonlinear, observations. We adopt a Bayesian approach to the 
problem. This approach has a long history for inversion, dating back to 1970, 
and has, over the last decade, gained importance as a practical tool. However 
most of the existing theory has been developed for Gaussian prior measures. 
Recently Lassas, Saksman and Siltanen (Inv. Prob. Imag. 2009) showed how 
to construct Besov prior measures, based on wavelet expansions with random 
coefficients, and used these prior measures to study linear inverse problems. 
In this paper we build on this development of Besov priors to include the case 
of nonlinear measurements. In doing so a key technical tool, established here, 
is a Fernique-like theorem for Besov measures. This theorem enables us to 
identify appropriate conditions on the forward solution operator which, when 
matched to properties of the prior Besov measure, imply the well-definedness 
and well-posedness of the posterior measure. We then consider the application 
of these results to the inverse problem of finding the diffusion coefficient of an 
elliptic partial differential equation, given noisy measurements of its solution. 

1. Introduction 

The Bayesian approach to inverse problems is an attractive one. It mathematizes 
the way many practitioners incorporate new data into their understanding of a given 
phenomenon; and it results in a precise quantification of uncertainty. Although this 
approach to inverse problems has a long history, starting with the paper |16] . it is 
only in the last decade that its use has become widespread as a computationl tool 
[19]. The theoretical side of the subject, which is the focus of this paper, is far from 
fully developed, with many interesting open questions. The work \16\ concerned 
linear Gaussian problems, and the mathematical underpinnings of such problems 
were laid in the papers |25l 124] . An important theme in subsequent theoretical work 
concerning linear Gaussian problems has been to study the effect of discretization, 
in both the state space and the data space, and to identify approaches which give 
rise to meaningful limits [211 [22]. In many imaging problems, the detection of edges 
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and interfaces is important and such problems are not well modelled by Gaussian 
priors. This has led to two recent works which try to circumvent this issue: the 
paper |23| introduces the notion of Besov priors, based on wavelet expansions, and 
the paper [rf uses heirarchical Gaussian models to create a discretization of the 
Mumford-Shah penalization, in one dimension. The thesis [27] studies a number of 
related issues for quite general measurement models. A different series of papers 
has studied the development of Bayesian inversion with Gaussian priors and non- 
linear measurement functions 7, 30 , in which general criteria for a Bayes theorem, 
interpreted to mean that the posterior distribution has density with respect to the 
prior, are given. This framework has been used to study the effect of approximation 
of both the space in which the prior lies and the forward model [8j [11] , allowing 
the transfer of error estimates for the approximation of the forward problem, into 
estimates for the approximation of the posterior measure. The goal of the present 
paper is to extend this type of approximation theory from Gaussian priors to the 
Besov priors introduced in [23], and in the case of nonlinear measurement functions. 
We consider the noisy nonlinear operator equation 

(1) y^giu) + r, 

with Q : X Y, X,Y Banach spaces and ij a F-valued random variable. We 
suppose in this paper that y, the operator Q and the statistical properties of ry 
are known, and an estimation of u G X is to be found. Such an inverse problem 
appears in many practical situations where the function of interest (here u) cannot 
be observed directly and has to be obtained from other observable quantities and 
through the mathematical model relating them. 

This problem is in general ill-posed and therefore to obtain a reasonable approx- 
imation of u in a stable way, we need prior information about the solution j l5[ 119] . 
In particular if we expect the unknown function to be sparse in some specific or- 
thonormal basis of X, implementing the prior information in a way that respects 
this sparsity will result in more efficient finite-dimensional approximation of the 
solution. For instance a smooth function with a few local irregularities has a more 
sparse expansion in a wavelet basis compared, for example, to a Fourier basis, and 
adopting regularization methods which respect the sparse wavelet expansion of the 
unknown function is of interest in many applications. Approximation techniques 
based on wavelet bases for recovering finite dimensional estimates of the unknown 
function are extensively studied in the approximation theory, signal processsing 
and statistics literature; see for example [U |6] [TS] [M] . The paper i2j introduced 
a nonparametric Bayesian approach to the problem of signal recovery, adopting a 
clever posterior construction tuned to the simple form of the observation operator. 
The paper [23] considers more general linear observation operators and constructs 
the wavelet-based Besov prior; the construction is through a generalization of the 
Karhunen-Loeve expansion to non-Gaussian coefficients and wavelet bases. 

We adopt a Bayesian approach to the above inverse problem, so that regular- 
ization is implicit in the prior measure on X. We study the Besov prior measure 
introduced in |23] as this measure is constructed so that it factors as the product 
of independent measures along members of a wavelet basis, see Section |3] We first 
make sense of Bayes rule in an infinite dimensional setting, and then study the finite 
dimensional approximations after having proved the well-posedness of the posterior 
over an infinite-dimensional Banach space. Bayes rule for functions is here inter- 
preted as follows. We put a prior probability measure /io(du) on u and then specify 
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the distribution of yju via ([T]), thereby defining a joint probability distribution on 
{u,y). If the posterior measure ^^{Au) = V{Au\y) is absolutely continuous with re- 
spect to the prior measure ijlq{(1u) — P(du) then Bayes theorem is interpreted as 
the following formula for the Radon-Nikodym derivative: 



Here $ depends on the the specific instance of the data the forward (or observa- 
tion) operator Q and the distribution of 77 (see Section [3]). For simplicity we work 
in the case where X comprises periodic functions on the d— dimensional torus T''; 
generalizations are possible. 

The problem of making sense of Bayes rule for nonlinear observation operators, 
and with a Gaussian prior, is addressed in [T', '30[; sufficient conditions on <i> and the 
Gaussian prior /iq which imply the well-definedness and well-posedness of the pos- 
terior are obtained. Key to obtaining the conditions on $ is the Fernique theorem 
for Gaussian measures which gives an upper bound on the growth of /zo-integrable 
functions. Our aim here is to generalize the results of [71 [30] to the case of Besov 
prior measures and so we need a similar Fernique-like result for the Besov measures. 
In Section [2l following 'SS' , we construct the Besov measures using wavelet expan- 
sions with i.i.d. random coefficients (Karhunen-Loeve expansions) of their draws, 
and prove a Fernique-like result for these measures (Theorem 12. ip . We then use 
this result in Section [3] to find the conditions on the operator Q which ensure the 
well-definedness (Theorem 13. 2p and well-posedness (Theorem 13. 3p of the posterior 
measure /i^, provided that the Besov measure /io is chosen appropriately. We then 
build on this theory to quantify finite dimensional approximation of the posterior 
measure, culminating in Theorem 13.41 

In Section m we apply these results to the inverse problem of finding the diffusion 
coefficient of a linear uniformly elliptic partial differential equation in divergence 
form, in a bounded domain in dimension d < 3, from measurements of the solution 
in the interior. Such an inverse problem emerges in geophysical applications where 
u is the log-permeability of the subsurface. It is studied using a Bayesian approach 
on function space in for Gaussian priors and in |28j for non-Gaussian priors 
which give rise to an almost sure uniform lower bound on the permeability. In 
many subsurface applications it is natural to expect that m is a smooth function 
with a few local irregularities, and it is not natural to expect an almost sure uniform 
lower bound on the permeability, nor is a Gaussian prior appropriate. For these 
reasons a wavelet basis, and hence a Besov prior, provides a plausible candidate 
from a modelling perspective. We show that the conditions we require for $ in 
this problem hold naturally when X is the space of Holder continuous functions C* . 
This also suggests the use of wavelet bases which are unconditional bases for C*. 
Thus for the elliptic inverse problem with Besov priors we state a well-posedness 
result for the posterior measure (Theorem 14.11) and we also quantify the effect of 
the finite dimensional approximation of the log-permiablility in the wavelet basis 
used in construction of the prior, on the posterior measure, with result summarized 
in Theorem |4?2] 



(2) 
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2. Besov measures 

We develop Besov measures following the construction in [23]. Let {'>pi}^i be a 
basis for L'^(T'^), = (0, 1]'', d < 3, so that any / e L'^{T'^) can be written as 

OO 

1=1 

Let X'^''^ be a Banach space with norm || • \\x=,i defined as 

/ oo \ 1/9 

with q > 1 and s > 0. We now construct a probability measure on functions by 
randomizing the coefficients of an expansion in the basis {"iAilz^i- The space X'^''^ 
will play a role analogous to the Cameron-Martin space for this measure. Indeed 
when {V'ili^i is chosen to be the Karhunen-Loeve basis for a Gaussian measure 
and g = 2, our choice of coefficients will ensure that X"'^ is precisely the Cameron- 
Martin space. 

Let I < q < oo, s > 0, and k > be fixed and {£,i}1^i be real- valued i.i.d. 
random variables with probability density function 

7r^(a;) oc exp(-i|a;|«). 
Let the random function u be defined as follows 

oo 

(3) u(a;)-^r(^ + ^-i)(i)l6^;(a:). 

1=1 

We will refer to the induced measure on functions u as fj,Q. We note that, since 
{tpi}fZi is orthonormal basis and 

oo 

u{x) = ^ uiipi{x) 

1 = 1 



with = / ^ d + 2 <j)(^i) -vve have 

oc oo 

nexp(--i6r)=n'^^p 




1=1 1=1 

= exp 

(4) =exp( 2, 

Thus, informally, u has a Lebesgue density proportional to cxp(— -I We 
say that u is distributed according to an X'^''^ measure with parameter k, or, briefiy, 
a measure. 

Remark 1. If {ipi}fZi O is an r-regular wavelet basis0 for L^(T'^), with r > s, 
then II • ||x=.<j is the Besov B^^ [31] norm and u is distributed according to a (k, B^^) 



^An r- regular wavelet basis for L^{K'^) is a wavelet basis with r-regular scaling function and 
mother wavelets. A function / is r-regular if / € C" and \d°'f{x)\ < Cm(l + for ^'^J 

integer m G N and any multi-index a with |a| = oi -(-■•• + o^j < r. An r-regular basis for L^{T'^) 
is obtained by periodification of the r-regular wavelets of L-^{W^) | 12II26| . 
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measure. Furthermore, ii q — 2 with {ipi}f^i either a wavelet or Fourier basis, we 
obtain a Gaussian measure with Cameron-Martin space i?|2' which is simply the 
Hilbert space 

fjs ^ H'^iT'). Indeed © reduces to 

u{x) = Y - X!^"^ ^' 
1=1 

where {^i}iZi are independent and identically distributed mean zero, unit variance 
Gaussian random variables. This is simply the Karhunen-Loeve representation of 
draws from a mean zero Gaussian measure. 

The following result shows that the random variable u is well-defined and char- 
acterizes apsects of its regularity. 

Proposition 1. j23| Let u he distributed according to a (k, X'*''?) measure. The 
following are equivalent 

i) ||u||xt.<! < oo almost surely. 

ii) E(exp(a||u||^t,g)) < c» for any a G (0,k/2); 
Hi) t < s — d/q. 

Part (ii) of the above proposition provides a Fernique-like result 10^ for Besov 
priors. Indeed for g = 2, this implies the Fernique theorem for all spaces , t < s—^ 
on which the measure is supported. The Fernique result for Gaussian measures, 
however, is much stronger: it shows that if u G X almost surely with respect to 
the Gaussian measure /i then E'' exp(e||M||^) < oo for e small enough. Theorem 
12.11 below goes some way towards showing a similar result for Besov measures by 
extending (ii) to C* spaces (see Remark[2]) ■ Theorem l2.1l will also be useful in Section 
|3] when we deal with inverse problems, in the sense that it allows less restrictive 
conditions on the prior measure. 

Before proving Theorem 12.11 we make a preliminary observation concerning the 
regularity of u given by ^ in C* spaces. With the same conditions on s and t as 
are assumed in Proposition [T] one can show that E||u||c't(T<i) < oo. Indeed, for any 
7 > 1 , and u given by ([3]) , using the definition of the Besov norm we can write 

\\u\\l\ 

-fq,-iq 

Noting that E|^/|^^ — C{^) and the exponent of I is smaher than —1 (since t < 
s — d/p), we have 

oo 

mu\\l\ =C(7)(i)^5]Z^(*-^)+^-^ < Ci(7). 

1=1 

Now for a given t < s — d/q, choose 7 large enough so that ^ < s — d/q — t. Then 
the embedding Bt/g .yg C C* for any ti satisfying t + < ti < s — d/q [SI] implies 
that E||m II pt(Td) < 00 and hence that u ^ /ig— almost surely. In the following 
theorem we show that, for small enough a, Eexp(a||u||(7t(Tti)) < 00. For the proof 
we use the idea of the proof of a similar result for Radamacher series which appears 
in Kahane . It is also key in this proof that the wavelet basis is an unconditional 
basis for Holder spaces [32]. 



00 

= (i)7^^^+¥-ir^9(i+^-i)|^;P9. 
1=1 
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Theorem 2.1. Let u be a random function defined as in 0) with q> 1 and s > d/q. 
Then for any t < s — d/q 

E(exp(Q;||u||c'')) < °o 
for all a £ (0, K/(2r*)), with r* a constant depending on q, d, s and t. 

Proof. First, let k = 1. We have (Mj 

with A; = Note that, as shown above, ||u||c' < oo, /iQ-almost surely. 

Fix r > and let 

A = {w e f7 : supA/|C;(a;)| > r}, 

B = [uj ^VL: supA/|^/(w)| > 2r}. 

len 

Consider the following disjoint partition of A 

A„i = {w e : sup A/|^;| < r, A„|^„| > r}, 
i<;<m-i 

and define 

= {w e 17 : supA/|^,| > 2r}. 

/>rn 

We have 

P(A„nB) =F(A™nB,„). 

Noting that 

Byn = U Bra+l with Cra = {\ra\£.m\ > 2r}, 

and 

A,n = Dm n E.m, with An = {ui E ft : sup A;|^;| < r}, 

l<i<m-l 
Em = {UJ e^l ■■ Xm\Cm\ > r} , 

we can write 

P(A„ n B) < P{Am nCm)+ nAm H 

= PiDm n C™) + P(A™)P(B™+i) 

(6) = P(D™)P(C„) + FiAmWiBm+i). 

We now show that P(C„i) < (P(£^„i))^ for large enough r. First let q > 1. We have, 
with f = r/Xm, Co = /r, exp(-i|a;|9) dx and q = 27r''/Vr(d/2), 

Co JB<i 
/>oo 

Cd / _d -io«J- 
= — / p^e 2^ dp 

Co J2f 

^2-^+1^ / p'^e-^^'dp 

Co Jr 

Co Jf 

Co Jr 

(7) =2'^+ie-^^'P(i?„0- 
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Then one can show that < F{E„i) for sufficiently large f. Indeed there 

exists Cq^d depending only on q and d such that for p > f = Cq^d and < 2' — 2 we 
have 

(8) 2^^+100^^ P^-^e-'''''/^ 

2cd 



Therefore we can write 



Co Jf 



/e-3P'dp 



>2'i+i/ !^p9-ie-(-+i)pV2dp 
Hence going back to Q we have 

(9) P(c„o < (P(£;„))2. 

For g = 1 we can calculate P{E„i) and P(C„i) as follows 

P(i?„) = ^^2'=+V'^-^(^)c-5, and P(C„0 = - ^ 2'^' (20'-' 

^° fc=0 ^ fc=0 ^ 

where Cd/co — 2'^ when g = 1. This readily shows (|9]) for the case of q = 1 as well. 
Substituting © in dH]) we get 

P(^m n < P(A«)(P(£^m))' + P(^™)P(B™+l) 

= P(i?„ n E^)¥{E„,) + P(A„)P(B„,+i) 

= p(yi„)P(£;™) + P(A„)iP(Sm+i). 

This, since i?m C A and -Bm+i ^ implies that 

P(A„. ns) < 2P(A„)P(A). 

Writting the above inequality for m = 1, 2, . . . and then adding them we obtain 

(10) P(B) =P(AnB) < 2(P(A))^ 

for r > Cq^d (note that f = r/Xm > Cq and Am < 1). 
Let ^(p) = P(||m||c* > p)- We have 

e«^ll"llc* ^o(du) = I e^ll"(")llc' p(da;) = - /" e^^d^(p). 
c*(T'') Jn Jo 

Let ^(r) = P(A) = l3 and note that by ^ we have 

^(r) = 1(2/3), ^(2r) < ^(2/3)^, . . . , ^(2"r) < 1(2/3)^", 



and therefore, choosing r large enough so that /3 < ^ 

+V 

e^^d^(p) < e2"+'^''^(2"r) = ^(2/3)"'" e 
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Hence, letting /? = 1/4, we can write 

- / e'P d^(p) < - / e'P d^(p) + V T^(2)-'" e" 

Jo Jo n ^ 



n=0 
oo 

=^Prl ^Bl^n^ 4- ^ _e-2"0n2-2er) 
n=0 ^ 

which is finite for e < This proves the desired result for k = 1, once we can 
identify the lower bound on r; a simple rescaling gives the result for general n. 

We need to choose r large enough so that P(yl) = 1/4 and ^ holds true as well. 
We have 

F(^) < ^ P{cj : (w) I > r} < — ^ / x'^e'^/^ 



— 1 A;— ■ 

- y 2'=+! ( /°° exp(-ix^-i) da; 

01 t^^ \kJ Jr 2 



d 

< 

Coi 



with coi = c^Vjjd e-l^l/^dx and noting that Ai = 1 and A^ = j(t~s)/d+i/q ^ ^^-^ 
choose ri — ri{s,t,d,q) such that 

/•oo ^ ^ 1 t 1 C 1 

/ a;'''^"'^'"-"^^^) exp( — x~^~^)dx<—-^ — -r-, for k — 0,...,d, 

Jr, 2 ' 4(d+l)2'=+i(f) 

and therefore ¥{A) < 1/4. To have (|8]) true as well, we set 

- 0, and c,,d = (2'^g co/c<j)i/('^-''+i) , for 1 < g < 1 + 3/4 
= 1, and Cg^rf = max{l, 2''+^gco/cd}, for q > 1 + 3/4. 

Since for n ^ 1, e — a/K, this implies that a < K/{2r*) with 

(11) r* = (ln2) max{ri,Cg^d}. 

□ 

Remark 2. Note that the bound on a in Theorem 12.11 is not sharp. Also, under 
the same condition as in Theorem 12.11 it is natural to expect a similar result to 
hold with power q of ||u||(7t in the exponent and for the result to extend to a norm 
in any space which has full measure; it would then be consistent with the Gaussian 
Fernique theorem that arises when q — 2 (see Theorem 2.6 in [10^). However we 
have not been able to prove the result with this level of generality. 



3. Bayesian approach to inverse problems for functions 

Recall the probabilistic inverse problem we introduced in Section [1] find the 
posterior distribution of u G X, given a prior distribution /xq of u € X, and 
y G Y given by ^ for a single realization of the random variable rj. We denote the 
distribution of 77 on K by Qo^dy). By ([T]) the distribution of y given u is known as 
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well; we denote it by Q"(d?/) and, provided is absolutely continuous with respect 
to Qoj we may define <!> : X x F ^ R so that 

(12) ^(y)^cxp(-<i>(u;y)), 
and 

(13) ^exp(-$(w;y))Qo(dy) = l. 

For instance if 77 is a mean zero random Gaussian field on Y with Cameron-Martin 
space [E, (•, ■)e, || ■ \\e) then the Cameron-Martin formula (Proposition 2.24 in [lOj ) 
gives 

(14) <i>{u;y)^^\\g{u)\\l-{y,giu))E. 

For finite-dimensional Y ~ M^ , when rj has Lebesgue density p, then we have the 
identity exp(-$(-u; y)) = p{y - Q{u))/p{y). 

The previous section shows how to use wavelet or Fourier bases to construct 
probability measures ^0 which are supported on a given Besov space i?*^ (and 
consequently on the Holder space C*). Here we show how use of such priors p^ may 
be combined with properties of defined above, to deduce the existence of a well- 
posed Bayesian inverse problem. To this end we assume the following conditions on 

Assumption 3.1. Let X and Y be Banach spaces. The function <f> : X x F M 
satisfies: 

(i) there is an ai > and for every r > 0, an M G R, such that for all u e X, 
and for all y € 1" such that \\y\\Y < r, 

$(u,y) > M ~ai\\u\\x; 

(ii) for every r > there exists K — K{r) > such that for all u & X, y ^ Y 
with max{||u||x, Hyllr} < r 

$(M,y) < K; 

(iii) for every r > there exists L = L{r) > such that for all ui,U2 G X and 
y eY with max{||Mi||x, ||u2|U, WyWv} < r 

|$(ui,y)-$(u2,y)| < L\\ui-U2\\x; 

(iv) there is an 0:2 > and for every r > a C G K such that for all yi , j/2 G Y 
with max{||yi||y, ||j/2||y} < 1" and for every u G X 

|$(u,yi) -$(u,y2)| < exp(a2||u|U -HC)||yi -y2||. 

3.1. Well-defined and well-posed Bayesian inverse problems. RecaU the 
notation no for the Besov prior measure defined hy^ and for the resulting pos- 
terior measure. We now prove well-definedness and well-poscdness of the posterior 
measure. The following theorems generalize the results of ,30j from the case of 
Gaussian priors to Besov priors. 

Theorem 3.2. Let <i> satisfy US]) and Assumvtion \3.1\f i]-(in). Suppose that for 
some t < 00, C* is continuously embedded in X . There exists k* > such that if Pq 
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is a measure with s > t + ^ and k > n* , then fi^ is absolutely continuous 

with respect to fio and satisfies 

rli/y 1 

(15) 

with the normalizing factor Z{y) — exp(— y)) /io(dM) < co. The constant 
K* = 2Cer* ai, where Cg is the embedding constant satisfying \\u\\x < Ce||M||c*; o,'>^d 
r* is as in hll]) . 

Proof. Define 7ro(du,d?;) = /io(du) ® Qoldy) and 7r(du,dj/) ~ ^o(du)Q'"(dy). As- 
sumption [3lljiii) gives continuity of $ on X and since ^iq{X) = 1 we have that 
$ : X R is /io-measurable. Therefore tt <^C ttq and tt has Radon-Nikodym 
derivative given by (fT^ as (noting that by and since ^iq{X) — 1, we have 
/_^^^exp(-$(u;y)) 7ro(d-u,dy) = 1) 

4— (";y) = exp(-$(u;?/)). 

This then by Lemma 5.3 of [TH], imphes that ^^{du) — 7r(du, d^|^ = y) is absolutely 
continuous with respect to /io(y) — 7ro(dM, d^|^ = y), since ttq is an independent 
product. This same lemma also gives p5|) provided that the normalization constant 
is positive, which we now establish. We note that all integrals over X may be 
replaced by integrals over X*''^ for any t < s ~ ^ since noiX*''^) — 1. First by 
Assumption 13 . 1 f i) note that there is M = M{y) such that 

Z{y)^ exp(-$(w;y))d^o(u) 
Jxt-i 

< / exp{ai\\u\\x - M)dfio{u) 
Jx*i 



< / exp{ai Ce\\u\\ct - M)d^oiu) 

Jx^'i 

This upper bound is finite by Theorem 12.11 since k, > 2cer*ai. We now prove 
that the normalisation constant does not vanish. Let R = E\\u\\xt,q noting that 
R E (0, oo) since t < s — -. As ||M||jft,g is a nonnegative random variable we have 
that /io(ll"l|x'.<! < i?) > 0. Taking r = max{||y|jy, i?}, Assumption 13 . 1 f ii) gives 

Z{y)^ exp{-<i>{u;y))dfio{u) 

Jxt.<i 



> / ex-p{—K)diio{u) 

= exp(-i^)^o(||"IU'.<! < R) 
which is positive. □ 

We now show the well-posedness of the posterior measure /i*' with respect to the 
data y. Recall that the Hellinger metric c?hgii is defined by 



dHcll(/i; Ai') = 



\ 




The Hellinger metric is independent of the choice of reference measure v, the mea- 
sure with respect to which both /i and /i' are absolutely continuous. The posterior 
measure is Lipschitz with respect to data y, in this metric. 
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Theorem 3.3. Let $ satisfy U3]} and Assumption \3.lV i)-(iv). Suppose that for 
some t < oo, C* is continuously embedded in X. There exists k* > such that if 
/iQ is a {k, X^'"^) measure with s > t + ^ and n > k* then 

duciiif^^ , ^J.^ ) <C\\y- y\\Y 

where C — C{r) with max{||?;||y, ||y'||y} < r. The constant k* — 2ce r*{ai + 2a2), 
where Cg is the embedding constant satisfying \\u\\x < Cellullc"*, and r* is as in ill]) . 

Proof. As in Theorem I3.2[ Z(y), Z{y') G (0, oo). An application of the mean value 
theorem along with Assumption 13 . 1 f i) . and (iv) gives 

\Z{y)~Z{y')\< [ \eM-Hu;y))-expi-^u;y'))\dfio{u) 



< / exp(ai||u|U - M)mu;y) - '^{u;y')\ d^u) 

< / exp{{ai + a2)\\u\\x - M + C)\\y-y'\\Ydfio{u) 
(16) <C\\y^y'\\Y, 

since < Ce||u||c't and Ce{ai + < K/{2r*). Using the inequality (a + b)^ < 

2(a2 + 62) 

2dHcii = ^^ (^Z{y)-i exp(-^'P{u:y)-{Z{y')-i exp{~^'l>{u;y')^ d^oH 



<h+I: 



2 



2 /■ / . 1 . , .1 ^ ^ 



where 

h = 2\Z{y)--^ ~ Z{y')--'f f expi~<f{u;y'))dfio{n) 

^2\Ziy)-^ -Ziy'r^fZiy'). 

Again, an application of the mean value theorem, and use of Assumptions I3.1f i) 
and (iv), gives 

^h< f ]exp{ai\\u\\x-M)expi2a2\\u\\x + 2C)\\y-y'\\\l^io{u) 

^ Jx*.<i 4 

<C\\y-y'\\l; 

since Ce(ai + 2a2) < K,/{2r*). Recall that Z{y) and Z{y') are positive and bounded 
from above. Thus by the mean value theorem and ()16p 

h = 2Z{y')\Z{yr'^ - ZiyT^"" < C\Z{y) - Z{y')\' < C\\y ~ 
The result follows. □ 

3.2. Approximation of the posterior. Consider $^ to be an approximation 
of $. Here we state a result which quantifies the effect of this approximation in the 
posterior measure in terms of the aproximation error in $. 
Define ^i^'^ by 

(17a) ^e.p(-*~(„)). 
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(17b) = / e^p{~^^{u))d^,o{u). 

Jx 

We suppress the dependence of $ and $^ on y in this section as it is considered 
fixed. 

Theorem 3.4. Assume that the measures fj, and fi^ are both absolutely continuous 
with respect to fiQ, and given by il5\) and ^17\ l respectively. Suppose that $ and $^ 
satisfy Assumvtion \3. lY i) and (ii), uniformly in N , and that there exist as > and 
C e M such that 

\<^{u) - < exp(a3||«||x + C)i^{N) 

where ip{N) as N ^ oo. Suppose that for some t < co, C* is continuously 
embedded in X . Let /iq be a (k, X^''^) measure with s > t + | and k > 2ce r*{ai + 
2a3) where r* is as in ill]) and Cg is the embedding constant satisfying \\u\\x < 
Ce 11^11(7* • Then there exists a constant independent of N such that 

The proof is very similar to the proof of Theorem 13.31 and, in the Gaussian case, 
is given in [30 ; hence we omit it. 

4. Application to an elliptic inverse problem 
We consider the ehiptic equation 

(18) -V • (e"(^)Vp(x)) = / + V -5, xeT'^, 

with periodic boundary conditions and with T'^ = (0, 1]'', d < 3, p, u and / scalar 
functions and g a vector function on T''. Given any u e L°°{T'^) we define X{u) and 
A{u) by 

A(u) = ess inf e"(^\ A(u) = ess sup e"(="). 

Where it causes no confusion we will simply write A or A. Equation (jl8p arises 
as a model for flow in a porous medium with p the pressure (or the head) and 
e" the permeability (or the transmissivity); the velocity v is given by the formula 
V oc — e"Vp. 

Consider making noisy pointwise observations of the pressure field p. We write 
the observations as 

(19) = pix,) + 7jj, x.eT" j = 1, • • • , K. 

We assume, for simplicity, that rj — {rij}^^^^ is a mean zero Gaussian with covariance 
r. Our objective is to determine u from y = {y^jj^i G ■ Concatenating the 
data, we have 

y = g{u) + T], 

with 

(20) g{u) = {p{xi),--- ,p{xK)f. 

In order to apply Theorem 13. 2[ 13.31 and 13.41 to the elliptic inverse problem we 
need to prove certain properties of the forward operator Q given by (|20p . viewed 
as a mapping from a Banach space X into R™. The space X must be chosen so 
that C* is continuously embedded into X and then the Besov prior /xq chosen with 
s > t + - . In this section | • | stands for the Euclidean norm. The following result is 
proved in [TT| . 
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Proposition 2. Let f e L''{T'^), g € L'^''{T'^). Then for any u e L°°{T'^) there 
exists C = C{K,d,r, llsllisr) such that 

\g{u)\ < C exp(||u||ioo(£,)). 
// ui,U2 G C^{D) for any t > Q. Then, for any e > 0, 

\Q{ui) - Q{u2)\ < C exp (cmax{||ui||ct(D), ||"2||c*(£')}) ll"i " U2\\l^(d)- 
with C = C{K, d, t, e, \\g\\L2r) and c = 4 + (4 + 2d) /t + e. 

Note that if instead of pointwise measurements of p, we consider ttie observations 
to be of tlie form {li{p), ■ ■ ■ , Ik{p)) where Ij : — )■ R, j = 1, . . . , K, are bounded 
hnear functionals, then one can get similar boundedness and continuity properties 
of Q assuming u to be only essentially bounded on T'^: Holder continuity is not 
needed. However, since we construct the prior using a countable orthonormal 
basis {?/'/};gn, requiring the draws of /xq to be bounded in L°°(T'^) results, in any 
case, in more regular draws which lie in a Holder space. This is because L°°(T'^) is 
not separable but any draw from /xq can be expanded in {■;/); }/gp}. It is thus natural 
to consider the case of pointwise measurements since very similar arguments will 
also deal with the case of measurements which are linear functionals on . 

4.1. Well-definedness and continuity of the posterior measure. Now 
we can show the well-definedness of the posterior measure and its continuity with 
respect to the data for the elliptic problem. As we noted in Remark [1] by choosing 
{■)/'; of ([3]) as a wavelet or Fourier basis we can construct a Besov (k, S^^) or a 
Gaussian {k,H^) prior measure {B22 = H'^). We have the following theorem: 

Theorem 4.1. Consider the inverse problem for finding u from noisy observations 
ofp in the form of ^^ and with p solving Let f G L''(T''), g G L^r^^d) 

consider to be distributed as a Besov {k, B'^^) prior with I < q < 00, s > d/q, 
K > for q = 2 and k > 4r* for q ^ 2 and r* as in ill]) . Then the measure 
fi^^du) is absolutely continuous with respect to fiQ with Radon-Nikodym derivative 
satisfying 



Furthermore, the posterior measure is continuous in the Hellinger metric with re- 
spect to the data 



satisfies (|13l) and Assumption I3.1f i) with M = cr^, c depending on F, and ai = 0. 
Using Proposition [21 Assumption I3.1f ii) and (iii) follow easily. By Theorem 12. 1[ 
IJ.q{C*{D)) = 1 for any t such that t < s — d/q and the absolute continuity of pL^ 
with respect to follows by Theorem 13.21 
We note that 




d^,n{py,iiy)<C\y-y'\. 
Proof. Let i < s - d/q and X = C*(T'*). The function 





< ciexp(||u||x)||/i - I/2I 
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Hence Assumption 13. II (iv) holds, and noting that a2 = 1, the continuity of /j.^ with 
respect to the data follows from Theorem 13.31 □ 



4.2. Approximating the posterior measure. In this section, we consider the 
approximation of a sufficiently regular m in a finite-dimensional subspace of L^(T'^) 
and use the Lipschitz continuity of Q together with Theorem 13.41 to find an error 
estimate for the corresponding approximate posterior measure. 

Let {i/jij/gN be an orthonormal basis of L^(T'') and W'^ — span{^/;i, . . . ,"0^}- 
Denote the orthogonal projection of L^(T'^) onto W'^ by and let CJ^ = Q{P^u). 
Define the approximated posterior measure fi^'^ by 

with Z'^ the normalizing factor. 

Now we write u £ i^(T'') in a wavelet basis: 

oo 

(22) u{x) ^ Ul(j){x) ^ Um.kll'mM^)- 

In the above equation (j> is the scaling function for L^{T'^), k — (fci, . . . , kd), Aj = 
{1, . . . , 2^^ - 1} X {0, . . . , 2^ - I}'*, and for each fixed j, 

where ipm are the mother wavelet functions for L^(M'')(sec Chapter 3 of [26 J. We 
also assume that the above wavelet basis is r-regular, with r sufficiently large (see 
Remark [T]) . 

We now impose one-dimensional indexing on the basis by setting = (j) and 
using the following numbering |23[ I26j for — ipm.k, I > 1, 



forj^O: l^2,...,2'^, 
forj = l: I = 2"^ + 1,...,2^'^, 



With this notation, the Karhunen-Loeve expansion of a function u drawn from a 
(k, i?^^)-Besov prior is the same as ([3]) and therefore the measure ^^'^ is an 
approximation to /i^ found by truncating the Karhunen-Loeve expansion of the 
prior measure to N terms using the orthogonal projection defined above. 
We have the following result on the convergence of fi^'^ to fj.^ a.s N ^ oo: 

Theorem 4.2. Consider the inverse problem of finding u £ C*(T'^), with t > 
0, from noisy observations of p in the form of 119\) and with p solving U8]} with 
periodic boundary conditions. Assume that the prior fiQ is a (k, B^^) measure with 
s > d/q + t, K > for q = 2 and k > 8r*(2 + (2 + d)/t) otherwise. Then 

We note that, although for a fixed N the rate of convergence of approximated 
posterior measure to /j,^ in the wavelet case is smaller than that of the Fourier 
case, where dudiifJ'^ , tJ'^''^) < C N~* (see [H]), one should take into account that 
we expect that the functions that solve the elliptic inverse problem of this section. 
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have a more sparse expansion in a wavelet basis compared to the Fourier basis (see 
also section 9.4 of [T^] or section 3.11 of ^5]). 

Proof of Theorera \4-2\ Let Vq and Wj be the spaces spanned by {0} and {'0m,fe}(m,fe)eAj 
respectively. Consider Qj to be the orthogonal projection in L^(T'') onto Wj, and 
Pj the orthogonal projection of L'^ onto ®^fZ}-^Wk © Vo- For any / G C*(T'') we can 
write [321 Proposition 9.5 and 9.6] 

\\f-P,f\\L^<C sup <C2-^*||/||c*. 

0<\x~y\<2-i 

Here and in the rest of this proof we represent any constant independent of / and 
j by C . Using the above inequality, we have 

\\Q,f\\L^ = \\P, + lf-P-jf\\L^ 

<\\f-Pjf\\L^ + \\f-P, + lf\\L^ < C2-^i/||c*. 

Hence 



oo oo 

<C\\u\\c. 2-^"* = C|ly|lc*2-('^+i)*^2-^-* 

j=J+l j=0 

< C||u||ct2-(-'+i)* < C 

By Proposition [2] we have 

mu) - $(P^7.)| < Cexp (ci Mc^iD)) N-'l", 
with ci > 4 + (4 + 2d)/t. The result therefore follows by Theorem [Ql □ 

Remark 3. Let be the orthogonal complement of in L^(T'*). Since /io 

is defined by the Karuhnen-Loeve expansion of its draws as in Q using 

it factors as the product of two measures ® on W'^ ® . Since Q'^ (u) = 

Q{P^u) depends only on P^u, we may factor ^^'^ as ^^'^ — where 

satisfies 

(23) ^{u) = ^ exp(-i|r-V2(y _ ^A'(^))|2) 

and = Mo"- With this definition of /i^ as a measure on the finite dimensional 
space and having the result of Theorem 14.21 one can estimate the following 
weak errors (see Theorem 2.6 of pT|): 

||E^''p-E^"p^|U^(T.) <C7V-*/^ 

IIE^" {p-p)^{p-p)- E^" (p^ ~ p^) ® (p^ - p^) lis < C N-'/'', 

with the solution of (HI]) for u = P^m, p = E^''' p, p^ = E^"p^ and 5 = 
C{H^{T'^),H^{T'^)). 
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5. Conclusion 

We used a Bayesian approach [T^ to find a well-posed probabilistic formulation of 
the solution to the inverse problem of finding a function u from noisy measurements 
of a known function Q of u. The philosophy underlying this approach is that formu- 
lation of the problem on function space leads to greater insight concerning both the 
structure of the problem, and the development of effective algorithms to probe it. In 
particular it leads to the formulation of problems and algorithms which are robust 
under mesh- refinement |30| . Motivated by the sparsity promoting features of the 
wavelet bases for many classes of functions appearing in applications, we studied 
the use of the Besov priors introduced in [23] within the Bayesian formalism. 

Our main goal has been to generalize the results of [30| on well-definedness and 
well-posedness of the posterior measure for the Gaussian priors, to the case of Besov 
priors. We showed that if the operator Q satisfies certain regularity conditions on 
the Banach space X, then provided that the Besov prior is chosen appropriately, 
the posterior measure over X is well-defined and well-posed (Theorems l3.2l and l3.3p . 
Using the well-posedness of the posterior on the infinite-dimensional space X, we 
then studied the convergence of the appropriate finite-dimensional approximations 
of the posterior. In finding the required conditions onQ, it is essential to know which 
functions of u have finite integral with respect to the Besov prior /xq. In other words 
we need a result similar to the Fernique theorem from Gaussian measures, for the 
Besov case. A Fernique-like result for Holder norms is proved in Theorem 12.11 and 
may be of independent interest. 

As an application of these results, we have considered the problem of finding the 
diffusion coefficient of an elliptic partial differential equation from noisy measure- 
ments of its solution. We have found the conditions on the Besov prior which make 
the Bayesian formalism well-posed for this problem. We have also considered the 
approximation of the posterior measure on a finite-dimensional space spanned by 
finite number of elements of the same wavelet basis used in constructing the prior 
measure, and quantified the error incurred by such an approximation. 

A question left open by the analysis in this paper is how to extract information 
from the posterior measure. Typical information desired in applications involves the 
computation of expectations with respect to the posterior. A natural approach to 
this is through the use of Markov chain- Monte Carlo (MCMC). For Gaussian priors 
there has been considerable recent effort to develop new MCMC methods which 
are discretization invariant |3l |9] in that they are well-defined in the infinite dimen- 
sional limit; it would be interesting to extend this methodology to Besov priors. In 
the meantime the analysis of standard Random Walk and Langevin algorithms in 
[4] applies to the posterior meausures constructed in this paper and quantifies the 
increase in computational cost incurred as dimension increases, resulting from the 
fact that the infinite dimensional limit is not defined for these standard algorithms. 
A second approach to integration in high dimensions is via polynomial chaos ap- 
proximation [29. and a recent application of this approach to an inverse problem 
may be found in [28]. A third approach is the use of quasi- Monte Carlo methods; 
see [5] . It would be of interest to study the application of all of these methodologies 
to instances of the Besov-prior inverse problems constructed in this paper. 
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